A Transformer Based Pitch Sequence Autoencoder with MIDI Augmentation

نویسندگان

چکیده

Despite recent achievements of deep learning automatic music generation algorithms, few approaches have been proposed to evaluate whether a single-track excerpt is composed by automatons or Homo sapiens. To tackle this problem, we apply masked language model based on ALBERT for composers classification. The aim obtain that can suggest the probability MIDI clip might be condition auto-generation hypothesis, and which trained with only AI-composed MIDI. In paper, amount parameters reduced, two methods data augmentation are as well refined loss function prevent overfitting. experiment results show our ranks \(3^{rd}\) in all 7 teams challenge CSMT (2020). Furthermore, inspiring method could spread other information retrieval tasks small dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Transformer fault diagnosis using continuous sparse autoencoder.

This paper proposes a novel continuous sparse autoencoder (CSAE) which can be used in unsupervised feature learning. The CSAE adds Gaussian stochastic unit into activation function to extract features of nonlinear data. In this paper, CSAE is applied to solve the problem of transformer fault recognition. Firstly, based on dissolved gas analysis method, IEC three ratios are calculated by the con...

متن کامل

Pitch - to - MIDI conversion Jari Salo

There are several kinds of methods and underlying models for monophonic fundamental frequency estimation of periodic or semi-periodic signals. The methods operate in various domains (time-, frequencyor joint time-frequency domains). A general structure and few examples of different approaches and their strengths as well as weaknesses are briefly described.

متن کامل

Recurrent Neural Network-Based Semantic Variational Autoencoder for Sequence-to-Sequence Learning

Sequence-to-sequence (Seq2seq) models have played an import role in the recent success of various natural language processing methods, such as machine translation, text summarization, and speech recognition. However, current Seq2seq models have trouble preserving global latent information from a long sequence of words. Variational autoencoder (VAE) alleviates this problem by learning a continuo...

متن کامل

Freepad: A Custom Paper-based MIDI Interface

The field of mixed-reality interface design is relatively young and in regards to music, has not been explored in great depth. Using computer vision and collision detection techniques, Freepad further explores the development of mixed-reality interfaces for music. The result is an accessible user-definable MIDI interface for anyone with a webcam, pen and paper, which outputs MIDI notes with vel...

متن کامل

Autoencoder-based holographic image restoration

We propose a holographic image restoration method using an autoencoder, which is an artificial neural network. Because holographic reconstructed images are often contaminated by direct light, conjugate light, and speckle noise, the discrimination of reconstructed images may be difficult. In this paper, we demonstrate the restoration of reconstructed images from holograms that record page data i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture notes in electrical engineering

سال: 2021

ISSN: ['1876-1100', '1876-1119']

DOI: https://doi.org/10.1007/978-981-16-1649-5_17